3D reconstruction based on hierarchical reinforcement learning with transferability

نویسندگان

چکیده

3D reconstruction is extremely important in CAD (computer-aided design)/CAE Engineering)/CAM manufacturing). For interpretability, reinforcement learning (RL) used to reconstruct shapes from images by a series of editing actions. However, typical applications RL for face problems. The search space will increase exponentially with the action due curse dimensionality, which leads low performance, especially complex spaces reconstruction. Additionally, most works involve training specific agent each shape class without related experiences others. Therefore, we present hierarchical approach transferability (HRLT3D). First, actions are grouped into macro that can be chosen top-agent. Second, task accordingly decomposed hierarchically simplified sub-tasks solved sub-agents. Different classical (HRL), propose sub-agent based on augmented state (ASS-Sub-Agent) replace set sub-agents, speed up process shared and having fewer parameters. Furthermore, ASS-Sub-Agent more easily transferred data other classes diverse states tasks. experimental results public dataset show proposed HRLT3D performs overwhelmingly better than recent baselines. More impressingly, experiments also demonstrate extreme our among different classes.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Tree Based Hierarchical Reinforcement Learning

In this thesis we investigate methods for speeding up automatic control algorithms. Specifically, we provide new abstraction techniques for Reinforcement Learning and Semi-Markov Decision Processes (SMDPs). We introduce the use of policies as temporally abstract actions. This is different from previous definitions of temporally abstract actions as we do not have termination criteria. We provide...

متن کامل

Hierarchical Memory-Based Reinforcement Learning

A key challenge for reinforcement learning is scaling up to large partially observable domains. In this paper, we show how a hierarchy of behaviors can be used to create and select among variable length short-term memories appropriate for a task. At higher levels in the hierarchy, the agent abstracts over lower-level details and looks back over a variable number of high-level decisions in time....

متن کامل

Hierarchical Explanation-Based Reinforcement Learning

Explanation-Based Reinforcement Learning (EBRL) was introduced by Dietterich and Flann as a way of combining the ability of Reinforcement Learning (RL) to learn optimal plans with the generalization ability of Explanation-Based Learning (EBL) (Di-etterich & Flann, 1995). We extend this work to domains where the agent must order and achieve a sequence of subgoals in an optimal fashion. Hierarchi...

متن کامل

Hierarchical Reinforcement Learning with Parameters

In this work we introduce and evaluate a model of Hierarchical Reinforcement Learning with Parameters. In the first stage we train agents to execute relatively simple actions like reaching or gripping. In the second stage we train a hierarchical manager to compose these actions to solve more complicated tasks. The manager may pass parameters to agents thus controlling details of undertaken acti...

متن کامل

Hierarchical Functional Concepts for Knowledge Transfer among Reinforcement Learning Agents

This article introduces the notions of functional space and concept as a way of knowledge representation and abstraction for Reinforcement Learning agents. These definitions are used as a tool of knowledge transfer among agents. The agents are assumed to be heterogeneous; they have different state spaces but share a same dynamic, reward and action space. In other words, the agents are assumed t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Integrated Computer-aided Engineering

سال: 2023

ISSN: ['1875-8835', '1069-2509']

DOI: https://doi.org/10.3233/ica-230710